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ABSTRACT 



An important problem arising from the increased sharing of information across 
networks is bandwidth constraint. Then limitations of communications channels in the 
transmission of volumous information is the singular bottleneck dictating processing 
capability and robustness of current and future distributed systems. Bandwidth utilization 
with the goal of optimizing the actual information transmitted, has to date, been ignored. 
Many of the current network strategies, both commercial, and tactical, rely on the 
repeated broadcast of a standardized message. As a result, much available bandwidth is 
wasted. The specific approach taken to maximize specific network node throughput on a 
digital network is a three-layer paradigm, managed by an embedded autonomous 
software agent located at each network node. The first layer consists of a network 
specific strategy for reducing the message content. The second layer is a frame by frame 
analysis of the reduced message content, to determine the best compression method to be 
applied to the information itself (MPEG, etc.). Finally a packaging strategy is utilized to 
ma ximi ze the compressed content for each specific network packet. The first phase of a 
proof of concept prototype has been implemented. Initial results, via a network 
simulation, have demonstrated a quantative 300% plus increase in effective information 
throughput capability, utilizing the same bandwidth. Since this approach is an embedded 
technique, existing network hardware, software, and standards remain uneffected. A side 
benefit witnessed is increased network responsiveness, due to increased information flow 
in a timely manner. In terms of processing time required, the cost is more than 
compensated for by increased network efficiency. The net result is a more efficient and 
responsive network capability. Future efforts will implement the entire node management 
capability described in this thesis. It is anticipated this capability will be introduced to the 
opporational fleet within the next five to seven years. 
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I. 



INTRODUCTION 



A. GENERAL 

The advent of the computer age has brought about a plenitude of benefits to the human 
race. Included with these benefits has been the ever increasing demand to transfer exponentially 
increasing amounts of information, and the associated problems of information sharing. The 
focus of this thesis, associated Naval Science Assistance Program (NSAP), and Office of Naval 
Research (ONR) funded research effort, has been to best utilize available digital communications 
assets in the radio frequency (RF) spectrum to allow sufficient transfer of information providing 
DOD assets flexible, rapid, and in-flight reprogramming, re-planning of strike and cruise missile 
assets, to engage a high value, emergent target, in the shortest possible time. The postulated 
methods of utilizing autonomous agents to manage information flow across network nodes has 
applicability to all digital networks. 

Based upon the pioneering work of Pattie Maes, at Massachusetts Institute of Technology 
(MIT) (4, 17, 31, 33, 53, 54), and previous examination of communications node management 
(37, 38), the implementation of independent processes, working on behalf of a host system, to 
optimize the effective meaningful throughput on a communications channel is not only 
desirable, but necessary. The evolution of semi intelligent software, whether called Artificial 
Intelligence, Intelligent Agents, or Autonomous Agents, has reached a level of sophistication 
allowing the insertion of meaningful articulated processes within existing, and future systems 
maximize the network efficiency systematically. Recent work by Michael Cohen on Sodabots 
(8), and the evolution of user interactive TinyMUDs (Multiple User Dimension) of the Maas- 
Neotek family (4), a virtual type personality environment, has demonstrated the ability of 
software to deal with dynamic and changing conditions. The additional, and exponential increase 
in micro-processor power has, for the first time, made available the hardware for such agent 
implementations as compact, self contained, embedded systems, in direct support of larger 
existing systems. 
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B. 



PROBLEM STATEMENT 



At present, tactical command and control networks, such as Link 16, do not employ data 
management techniques to maximize the utilization of the RF spectrum. With respect to the 
surveillance Net Participation Group (NPG), information updates on platform tracks are 
transmitted in full and at a high rate. This is true even when only a single data field (e.g., the 
location) of the track information has changed. The rapid repetition of redundant information is 
a characteristic of other NPGs as well. One of the reasons for this repetition is that a listener, if 
he misses a message, can simply ignore it — another chance to receive the information will soon 
follow. An important drawback of this passive design, however, is that the bandwidth needed to 
support RTR requirements will not be available. 

It is assumed that no new tactical networks (hereinafter referred to as “links”) will be 
available for the foreseeable future, and certainly not in time to meet the needs of the RTR 
Cruise Missile demonstration and the RTR program that supports it. 

Therefore every effort must be made to obtain more efficient use of bandwidth from 
existing links. There are five methods, independent of efforts to handle network entry, and 
passive synchronization, that this effort will explore: Atomic Data Element Transmission, 
Update Bundling, Extrapolation-Driven Updates, Traditional Compression, and Active Network 
Management. In FY96, message size reduction was demonstrated on a closed loop link 
simulation, built upon existing assets. These initial achievement are leading toward benchmark 
demonstrations utilizing NRaD’s Systems Integration Facility (SIF) to establish best-case 
effectiveness. In later years, benchmarks will be established for additional techniques. 

The ultimate aim is to increase the available bandwidth, through efficient and more 
formal management of existing tactical networks, without changing network operation rules or 
message integrity assurance. This capability will result in a new way of handling information on 
existing links, with the possibility of extending capability to a new message specification while 
coexisting within the given operational network. 

As demonstrated by the news presented every day, reactionism is ever increasing in 
today’s complicated world. The demise of the former Soviet Union has eased the potential for 
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led exchanged by responsible parties, but has dramatically increased the potential of 
utilities with the slightest trigger event leading to potential larger exchanges, and loss of life. 

A small trigger event 




can lead to aggression 




leading to a response 




leading to escalation 




targets.mov 



leading to a larger response 




all within a very short period of time. One key point to bear in mind, even the smallest have not 
organizations, or countries have access to weapons of mass destruction. The rational mindset 
does not necessarily apply here, therefor the need for rapid response dictating efficient 
communications throughput capacity. 




II. RISK ASSESSMENT FOUNDATION FOR DESIGN PRINCIPLES 



A. INTRODUCTION 

This document describes the engineering design of an autonomous or knowledge-based 
agent designed to increase effective throughput on tactical links. The agent itself is called 
“Agent,” and each instance of the agent is identical on all nodes in the network. The sole 
assumption made by the architecture is that each node on the network will have an agent 
associated with it able to intercept and manipulate both incoming and outgoing messages. 

The architecture is knowledge-based in all other respects. Thus it is not specific to any 
particular tactical link or digital network. To operate on some other network requires 
modifications to the knowledge bases and facts, but no modifications to the agent itself. 

This document is not intended to function as a tutorial for knowledge-based systems 
concepts. It assumes a high level of sophistication in traditional concepts and addresses where 
the engineering design departs from them. Neither is this document intended to be a tutorial 
about Link- 16 or other tactical communications networks. 

The principal accomplishment was the design of an inference engine specifically tailored 
for intelligent agents. The special features of this inference engine are described in depth so that 
detailed designers can know what to do. 

Each process in the system is controlled by meta rules and a special meta controller that 
is part of the agent run-time environment. Although an elaborate meta language was designed, 
adding sophistication to the inference engine reduced reliance on sophisticated meta rules. Very 
few functions are involved at present. 

The run-time environment for the agent was not designed. The assumption was that one 
or more commercial agent-building tool kits, such as IBM’s ABE (Agent Building Environment) 
would be available to initialize and run the agent. Since these tool kits can be modified to use a 
custom inference engine, and because the facts and rules used by the Agent system can in 
general be translated to and from KIF, the assumption that a tool kit would be available at the 
time of detailed design seems reasonable to the author. However, there is nothing in this system 
that requires the use of such a tool kit. The run-time mechanisms necessary to control the 
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processes and inferencing are not complicated. This supports an implementation structure in 
which the agent can run on its own machine inserted at each node between the node and the 
network. 

This document assumes Link- 16 concepts but the methods of autonomous agents are not 
specific to Link- 16. 

B. PROBLEM STATEMENT 

At present, tactical command and control networks, such as Link 1 6, do not employ data 
management techniques to maximize the utilization of the RF spectrum. With respect to the 
surveillance Net Participation Group (NPG), information updates on platform tracks are 
transmitted in full and at a high rate. This is true even when only a single data field (e.g., the 
location) of the track information has changed. The rapid repetition of redundant information is 
a characteristic of other NPGs as well. One of the reasons for this repetition is that a listener, if 
he misses a message, can simply ignore it — another chance to receive the information will soon 
follow. An important drawback of this passive design, however, is that the bandwidth needed to 
support information exchange requirements will not be available. 

It is assumed that no new tactical networks (hereinafter referred to as “links”) will be 
available for the foreseeable future. 

Therefore every effort must be made to obtain more efficient use of bandwidth from 
existing links. There are five methods, independent of efforts to handle network entry, and 
passive synchronization, that this effort will explore: Atomic Data Element Transmission, 
Update Bundling, Extrapolation-Driven Updates, Traditional Compression, and Active Network 
Management. In FY96, message size reduction was demonstrated on a closed loop link 
simulation, built upon existing assets. These initial achievements are leading toward benchmark 
demonstrations utilizing Naval Research Development Test & Evaluation (NRaD) Systems 
Integration Facility (SIF) to establish best-case effectiveness. In later years, benchmarks will be 
established for additional techniques. 

The ultimate aim is to increase the available bandwidth, through efficient and more 
formal management of existing tactical networks, without changing network operation rules or 
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message integrity assurance. This capability will result in a new way of handling information on 
existing links, with the possibility of extending capability to a new message specification while 
coexisting within the given operational network. 



C. GLOBAL REAL TIME RETARGETING REQUIREMENT 



The basis for this thesis was guided by sponsor desires to support rapid mission re- 
allocation of assets in support of emergent threats on a real time basis. FY96 Program Execution 
Plan for Real-Time Retargeting (RTR) provides a good list of the general requirements for real- 
time retargeting, and basis by which the concept and development of agents is being 
accomplished: 

The ONR Real-Time Retargeting (RTR) Accelerated Capability Initiative (ACI) is aimed 
at providing the cruise missile and tactical aircraft warfighters a new capability to 
redirect their strike forces in real-time to respond to current dynamics of the battlefield. 

The focus of the ACI is on time-critical targets. To engage such targets, these 
warfighters must plan/replan, control, and execute strikes in 5 to 30 minutes, instead of 
the military’s current operational capability of many hours. Current shortfalls that inhibit 
these warfighters from responding in real-time are: (1) lack of automated real-time 
mission (re)planning capability, (2) lack of communications availability and capacity, (3) 
lack of automated target detection and recognition, (4) lack of rapid mission management 
and execution decision aids, (5) lack of real-time terminal targeting information 
dissemination between surveillance assets and strike platforms, and potentially between 
the strike platforms themselves, and (6) lack of necessary available bandwidth to support 
robust rapid data transmission in support of RTR. This equates into a lack of real-time 
threat awareness on board the strike platform. 

Shortfalls two and six are addressed by this effort. Without requiring a change to the net 
protocols or integrity assurance, this effort is developing methods for extending the effective 
bandwidth of existing tactical networks. By utilizing algorithms and system structures which are 
typically not associated with throughput optimization, substantial improvements can be realized 
for extending the communications capabilities of existing networks. The technologies developed 
by this research will support the capabilities of Tomahawk Block IV+ or TSTAR. It will also 
apply to any digital network through optimization of available bandwidth. This effort is not a 
cure-all for network bottlenecks, but rather a best use of what is available. 
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D. 



SCIENCE AND TECHNOLOGY SHORTFALL 



Link 16 is predominately a passive system. This means that messages, such as track 
updates, are transmitted so frequently that they need not be acknowledged by receiving 
platforms. Therefore, if a platform misses a message, it need only wait a few seconds for an 
update. The goal of the design within the surveillance NPG is to provide a complete tactical 
picture in twelve seconds or less, for as many as two thousand tracks. Although the total load on 
the link by the surveillance NPG is not as great as that produced by voice communications or 
video conferencing, it is significant nevertheless. 

In order to adapt the link to RTR requirements, both transmitting and receiving platforms 
need to be smarter, reducing both the level of redundancy in the messages and the frequency 
with which they are transmitted. The idea is to insert processor capability between the C2P and 
the JTIDS terminal on all platforms to distribute network control and manage message content in 
a smarter way. 1 Because of the complexity of the rules of the network and the communications 
between C2P and JTIDS, an agent based architecture is the ideal candidate: it is designed to 
operate transparently, can take action on its own volition, is extensible and flexible, and is 
capable of being included, and / or integrated, in future system upgrades, and will handle the 
complexity of network entry / passive synchronization, while maintaining maximized throughput 
capability of only “changed” information. Distributed control architectures in an object-oriented 
environment can produce smart and complex behaviors that are not generally achievable with 
traditional techniques. 



1 The eventual target for this work is probably the CDLMS effort (Common Data Link Management System), 
which will at some point produce a system replacing the current C2P. 
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E. TECHNICAL APPROACH 



1. General Discussion 

Depending upon available resources, the approach to the FY96 tasks was to establish 
performance objectives and baselines. Sanitized data from actual network operations were 
acquired from the Systems Integration Facility (SIF). This data established baseline for 
evaluating redundancies in network communications. From this data, a decrease in loading is 
expected to result from the technique(s) being developed by this effort. Additional candidate 
techniques are under development and will be tested separately. 

The software written to date has employed reusable, object-oriented methods/techniques. 
A need for a real-time object-oriented data base management system (OODBMS) is anticipated 
as system capabilities are developed. The system architecture is designed to incorporate such 
tools as they are needed and can be exploited. Also, a PMW 159 sponsored effort called the 
Common Data Link Management System (CDLMS), will result in an object-oriented tactical 
communication system. Thus, employing an object oriented style will reduce costs for future 
integration efforts. 

A number of commercial OODBMS’s, such as ObjectStore, Versant, ONTOS, and 
Polyhedra, are under consideration and are to be evaluated as the need arises. Their ability to 
satisfy real-time demands of link messaging, is considered a critical feature. The ability to 
manage storage and retrieval of temporal data, is also a critical factor. 

Finally, objects are by definition, independently executing processes that are designed for 
maintainability and reuse. From their hierarchical orientation, the levels of abstraction in object 
oriented systems are more explicit and straightforward than traditional styles. As the initial 
exploratory efforts are performed, and lower level object methods are created, developing more 
abstract levels of processing is relatively fast, reliable, and adaptable toward new capabilities. 
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F. 



ASSUMPTIONS 



The following assumptions are made with respect to the proposed modifications 
suggested herein: 

•Any modifications made between two C2P / JTIDS terminal linkages can be 
accomplished, as long as the rules for the particular network in question are followed. 

•Messages will have specific data elements recognizable to the C2P. As for transmission 
methods are concerned, data is data, and can be manipulated as desired, so long as the 
reconstituted message elements follow the assumption above and do not violate any link 
integrity requirements. 

•Link rules enforce standards. As long as the -end results obey link rules, the proposed 
modifications can be performed, given timeliness and robustness are maintained. 

•Rules evolve as systems evolve, tactics change, and lessons are learned. Software 
designed under this effort is flexible, allowing changes in rules without forcing system redesign. 

G. AGENTS FOR TRANSPARENT OPTIMIZATION OF 
INFORMATION TRANSMISSION 

An “agent” is a relatively-independent software entity, or collection of such entities, that 
performs tasks on behalf of human users but also can perform on behalf of other computer 
applications or systems. There are two important features of software agents. First, they are 
anthropomorphic processes. As much as possible, agents behave and communicate in ways that 
are similar to and compatible with human behavior and communication. As such, they are not 
necessarily “smart” or “intelligent,” but like humans, they tend to be goal driven. 

Resource management is a common task for software agents. In this capacity, they 
operate at a higher level of abstraction than the user or system they oversee. Often they are 
separate processes initiated prior to the initiation of the system they oversee. Like human 
managers, they are able to “watch” user or system operations, detect problems, formulate 
corrective plans, and carry them out. Some can study the effects of their interventions and 
modify their behavior accordingly. 
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Distributed set of intelligent agents may provide the means and method for overseeing 
the operations of an operational network such as Link- 16. Present at each platform, observing 
transmissions, received messages, and user interactions, agents can be designed to optimize link 
performance and enhance operator efficiency. 

In late 1996, a demonstration capability was developed and an initial evaluation of 
network messages was performed. In 1997, an initial design of a software agent, hereinafter 
called “Agent,” was prototyped (proof of concept), while longer-range architectural ideas are 
considered. In the out years, the final agent architecture will: 

•Accommodate the differing needs of different platforms without recoding; 

•Understand the rules of the links so that all its actions will conform to them; 

•Be extensible, either directly or through cloning with different knowledge bases, 
to other messages and functions; 

When a message is sent from the C2P to the JTIDS, the agent determines, by checking its 
type, whether or not it is of interest. If it is not, the message is simply reasserted on the outgoing 
bus. 

The design goal for Agent architectures is to be able to accommodate future functional 
requirements by adding additional knowledge, leaving the engines and interpreters largely intact. 
The result is a system that can be extended at minimal cost. 

The key design issue is real-time performance. In general, the higher the level of 
abstraction, the higher the overhead. Therefore, particular implementations, such as Link 16, are 
driven by the time constraints of the links, available system resources, and scalability of the 
agent-based architecture. Fortunately, agent-based architectures for Intelligent Real-Time 
Systems and Real-Time Expert Systems, have addressed similar hard real-time constraints. This 
work draws upon those results. 
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H. LINK 16 GENERAL DESCRIPTION 



1. Link-16 Functional Description 

Link- 16, or TADIL J, uses the Joint Tactical Information Distribution System. JTIDS 
refers to the communication component of Link- 16 that encompasses the software, hardware, RF 
equipment and the waveform that they generate. Among NATO subscribers, the equivalent term 
for JTIDS is the Multifunctional Information Distribution System (MIDS). Link- 16 employs 
netted communication techniques and a standard message format for exchanging digital 
information among airborne, land-based and shipboard tactical data systems. Link- 16 does not 
significantly change the basic concepts of tactical data link information exchange supported for 
many years by Link- 11 and Link-4A. Link- 16 provides technical and operational improvements 
to existing tactical data link capabilities. The improvements include nodelessness; electronic 
countermeasure (ECM) resistance; flexibility of communication operations; separate 
transmission and data security; increased number of participants; increased data capacity; 
network navigation features; and secure voice. Link- 16 also uses a Time Division Multiple 
Access (TDMA) architecture, which provides multiple and simultaneous communication nets. 

2. Link-16 Operation 

Link- 16 operates at L-band frequencies (969 - 1206 MHz) with 3 MHz channel spacing 
and a power output of 200 - 1260. It transmits a frequency-hopping transmission pattern using 
51 frequencies at 3 MHz intervals, excluding identification friend or foe (IFF) sub-bands. 
Frequency-hopping and other spread-spectrum techniques make Link- 16 resistant to jamming 
and data encryption makes it secure. The Link- 16 message standard consists of one or more 70- 
bit words plus, error detection, correction bits, and symbols. The word formats are used to allow 
a single message to convey position, track data, weapons control and command messages. 
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3. 



Link-16 Configuration 



The major components of the Navy shipboard Link- 16 system include the following: 
Tactical Data System (TDS), Command and Control Processor (C2P), JTIDS terminal and 
JTIDS antennas. The TDS and C2P provide the tactical data to be exchanged. The JTIDS 
terminal and antennas provide the secure, anti-jam, increased capacity waveform. The JTIDS 
terminal is composed of two major components: the receiver/transmitter (R/T) and the data 
processor group (DPG). The R/T is common to all platforms. The DPG contains a digital data 
processor and an interface unit (IU). The IU is tailored specifically to each type of platform. 

There are two configurations of Link- 16, known as Model 4 and Model 5. The Model 4 
implementation of Link-16-also referred to as Block 0 on Advanced Combat Direction System 
(ACDS) platforms-was designed as a transparent equipment upgrade to existing ship’s tactical 
systems. Model 5-also referred to as Block 1 on ACDS platforms-is the full and complete 
implementation of Link- 16 in accordance with OS5161. 




Figure 1 
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Figure 2 



Tactical 




Figure 3 



I. AGENT-BASED TECHNIQUES FOR REAL-TIME SYSTEMS 



1. Introduction 



The following summary is from Huang [63]: 

George Saradis introduces a three-level hierarchy [Saradis, 1985]. Acar and Ozguner 
present an alternative architecture that organizes the system into a multiresolution 
hierarchy by identifying its components and examining the physical relationships among 
them [Acer and Ozguner, 1990], Antsaklis and Passino describe a hierarchical control 
architecture that uses a hybrid approach to model systems with a high degree of 
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autonomy [Antsaklis and Passino, 1993], Successive delegation of duties from higher to 
lower levels is one of this hierarchy’s important characteristics. In another chapter of 
Antsaklis and Passino, Meystel describes a nested hierarchical control theory that 
includes the concept of treating design and control as a continuum [Meystel, 1993]. 

Regarding implementation of intelligent control systems, research has focused on 
software technologies and computer-aided software-engineering environments. Sweet 
and his colleages identify key software technologies for the Aerospace Industries 
Association [Sweet et al, 1989]. Simmons describes a Task Control Architecture 
[Simmons 1990]. One limitation of TCA might be scalability because it is not intended 
to model multiple-cooperating agents. Object-oriented paradigms are becoming popular 
for handling the representation problems of software systems, but they are not suitable 
for all problems. For example, Schneider and his colleagues developed a flexible, object- 
oriented, real-time software implementation tool called ControlShell [Schneider et al., 
1994]. But this tool does not address the issue of architecture; perhaps a reference-model 
architecture could complement its capabilities. 



Huang is at the National Institute of Standards and Technology (NIST) where, for over 
two decades, Jim Albus has been developing the Real-Time Control System (RCS) reference- 
model architecture. A reference-model does not describe how system structures are represented 
and implemented, but rather seek to conceptualize the high level functionality and process 
interdependency. Object-oriented (00) paradigms have become popular because they can 
reflect the inherent structure of reference models. Given a reference model or system 
architecture, the actual implementation, as driven by the model or architecture, is typically a 
separate design effort. Intelligent real-time system architectures are discussed in more detail in 
the next section. 



2. Intelligent Real-Time Systems (Robotics) 

An approach and methodology for engineering intelligent real-time systems is discussed 
by Durham [44, 45, 46], In terms of systems architectures, Durham considers intelligent real- 
time systems functionally equivalent to intelligent robot systems. The two types of systems share 
similar requirements and objectives, and only differ in terms of the “plant” or environment for 
which they interact. When a robotic agent interacts with a subset of a system such that its “plant” 
is restricted to system software, such agents have been called “softbots,” i.e. software robots. 
Softbot applications do not typically have hard real-time requirements and the real-time 
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limitations of softbots is an open issue. To address such issues, this effort capitalizes on the 
recent advances in autonomous systems and telerobotics. 

Figure 4, below is a diagram of a generic intelligent system architecture derived from 
some early results in the ARPA Autonomous Land Vehicle (ALV) program. Figure 5 is from 
Meystel’s recent work in “Semiotic” system architectures [59]. These diagrams illustrate the 
generic high level architecture of real-time intelligent systems. 




Figure 4 




Figure 5 

Figures 6 and 7 are two diagrams from an intelligent system architecture as described by 
Valavanis and Saridis [60]. These diagrams further illustrate a functional organization and 
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layering. The highest level processes are similar to the executive level within a business 
organization. Longer term organizational structures, such as strategies and mission statements, 
are the types of objects and data structures to be processed. The coordination of day-to-day 
operations is a lower level management function. Specification and design of specific tasks is a 
critical “coordinator” function. Finally, the labor of humans and machines actually execute the 
operations of the organizational entity. A “topology” of organizational structure has emerged 
from previous efforts in real-time intelligent systems. 
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These diagrams illustrate an anthropomorphic model for intelligent real-time systems 
design. This type of system architecture exhibits a number of properties. First, a hierarchy of 
processes is defined. The “higher level” processes require more “abstract” processing such as 
symbol creation and manipulation. “Lower level” processes directly interact with the 
environment. For Agents, the environment is the message data stream. Secondly, timing 
requirements tend to be inversely proportional to the level of a process. Low level processes 
need to be highly responsive. High level processes are often very slow to respond. This layering 
of timing requirements allows a system to maintain sufficient interaction with its environment 
while working to improve its capabilities. The “scope” of systems processes tends to be layered 
in a similar manner. Higher level processes incorporate system-wide and long range data 
elements, i.e. they process the “big” picture. Low level processes have well defined and narrow 
tasks tailored to specific increments of time. This leads to a third characteristic of intelligent 
real-time systems. They are structured such that an “abstract” “high level” goal or set of goals 
can be designed and understood without explicit attention to the numerous run-time or 
operational details. The system is designed to map the high level goals into a set of lower levels 
functions necessary to achieve the given goal(s). Lower level processes, such as control loops, 
have their own low level goals, such as set points. The specific low level processes performed at 
specific points in time are determined by, or at least influenced by, higher level processes. 

Figure 8 is also from Valavanis and Saridis [60]. This diagram illustrates that intelligent 
control is the intersection of three separate areas of work. The requirements and objectives of 
Intelligent Control inherently overlap with Artificial Intelligence, Operations Research, and 
Control Theory. Each one of these disciplines in turn interact with each other and define a 
number of specialized subdisciplines. From this organizational perspective, a systems designer 
can recognize that, if properly managed, a considerable amount of knowledge and talent can be 
utilized for a real-time system application such as throughput optimization of communications 
networks. The development and demonstration of this technology insertion capability is central 
to the Agent concept. A mechanism for efficiently and effectively inserting new capabilities into 
operational legacy systems is a primary objective and focal concern. 
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3. Real-Time Expert Systems 

Expert systems are not typically real-time systems in terms of their inherent architecture. 
The heuristic search algorithms typically employed are not guaranteed to meet hard real-time 
requirements. A number of intelligent control systems employ “Expert System” algorithms and 
perform within their requirements. Expert systems have become an integral component of 
intelligent systems, but their application tends to be best for “organizational” processes. Fuzzy 
logic expert system techniques have been recently demonstrated and they may prove to be quite 
valuable at the coordination and execution level of the Agent processes [61]. 
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Figure 9 
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J. 



OTHER CRITICAL TECHNOLOGIES 



1. Information Theory: Mathematics of Communication 



a. Definition of Information 



The following is from Hamming [49]; 

Suppose that we have the source alphabet of q symbols s,, s 2 , ... , s q , each with its 
probability p(s,) = p,, p(s 2 ) = p 2 , ... , p(s q ) = p q . When we receive one of these symbols, 
how much information do we get ? For example, if p, = 1 (and, of course, all the other p, 

= 0), then there is no “surprise,” no information, since you know what the message must 
be. On the other hand, if the probabilities are all very different, then when a symbol with 
low probability arrives, you feel more surprised, get more information, than when a 
symbol with a higher probability arrives. Thus information is somewhat inversely related 
to the probability of occurrence. 

From an information theoretic perspective, “surprise” is associated with the probability 
of a communication event. Data can be transmitted but unless there is a lack of predictability, 
there is no surprise and therefore no information content. Agent based communication links 
capitalize on this understanding of information content. There are a number of ways that the 
degree of “surprise” can be taken out of network messages. Traditional data compression 
techniques capitalize on a relatively small number of possible approaches. Traditional data 
compression, and coding theory in general, will be exploited and incorporated within the Agent 
architecture, but this is not the technology being developed. 

Agent based communications should not be confused with data compression 
technologies. Through efficient management and accounting of what is known, or can be known 
before messages are transmitted, agent based communication links share the knowledge in a 
manner such that there is less “surprise” in the messages transmitted. Thus, Agents are able to 
communicate such messages with fewer bits. 

By incorporating engineering and design knowledge of the platform tracks, the behavior 
of the tracks becomes predictable and, thus, less of a surprise from one message to the next. 
Agent based communication also provides a mechanism for incorporating command and control 
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(C2) knowledge. For C2, different track types typically have different operational requirements 
for position resolution and update rate resolution, the operational requirements for actual 
message transmission times and resolutions can be known before messages are transmitted, and 
therefore, the track messages can be communicated at a lower rate with fewer bits per message. 
Also note that redundancy can be added when operationally required for increasing the 
confidence that messages will be received. 

Agents are network processes that have a primary goal of identifying and sharing system 
knowledge that is not shared by the current communication links. One of the assumptions, as 
stated earlier, is that with minimal impact, such a collection of processes can be inserted into an 
existing communication network. With Agents, otherwise unutilized knowledge is then used to 
take the “surprise” out of the messages as they were originally communicated. Agents exploit 
knowledge within the scope and bounds of typical data compression methods, but data 
compression is not the technology under development. Data compression related knowledge is 
considered a relatively small portion of the system wide tactical knowledge available for 
maximizing information throughput. By identifying and accounting for system wide knowledge, 
more robust and effective communication is realized while increasing the effective throughput of 
the tactical network. Agents, by design, are background network processes which continuously 
work to identify and manage tactical network knowledge for achieving their own goal of 
maximizing effective throughput. 

b. Measures of Information , e.g. Entropy 

Claude Shannon's paper in the 40’s established a mathematical basis for the theory of 
communication [56]. Shannon defined a mathematical function, called entropy, for measuring 
information content. Shannon's entropy function is defined as 




22 



where N is the number of symbols communicated, q is the number of possible symbols, and p, is 
the probability that the i - th symbol is in the N length message. As noted in Hamming [49], "It 
is important to realize that a remark like 'Consider the entropy of the source' can have no 
meaning unless a model of the source is included. Your estimate of the entropy of a source of 
symbols therefore depends on the model you adopt of the structure of the symbols.” 

A variety of metrics exist for defining and determining information content. Kapur [51, 
52] provides a number of examples and references. There is no single absolute measure of 
information. Entropy and working model of the source is necessary. A model is needed to 
specify the symbol set and probabilities for the given symbols for all points in time. Usually, the 
probability distributions are assumed to be “stationary.” This means that the statistics of the 
symbols do not change with time. Operationally, nonstationary data is quite common. Thus, 
potential performance of “optimal” data compression schemes such as Huffman coding, cannot 
be realized. Alternative schemes, such as Lempel-Ziv dynamic dictionary based approaches, 
often demonstrate better operational performance. Thus, there is no one ideal technique or 
underlying model for a given coding scheme. 

Ideally, a more abstract “context dependent” network process would dynamically identify 
and select the appropriate measures of information and their most appropriate models. Such 
metrics and models can then be selected based on the degree of compression which can be 
operationally realized at a given point in time. While managing and exploiting other types of 
knowledge, such as “system design knowledge” and “C2 knowledge,” Agent processes can also 
manage “data compression knowledge.” Thus, improved operational performance of traditional 
data compression schemes is inherent in the Agent architecture. 

c. Universal Compression and Retrieval 

The following summary note is from Krichevsky [50]: 

The output of a source may be compressed up to its entropy, not more. It is the main fact 
of the source coding theory, which has its origin in Shannon (1948). The first theoretical 
compressing code is Shannon’s. It happened to be very close to Morse’s, which was 
developed empirically a century earlier. Shannon’s code is very simple, although not 
optimal. An optimal code fora stochastic source was constructed by Huffman (1952). It 
may be used to compress an individual word. The frequencies play the role of 
probabilities. The encoding may be either two-pass or one-pass. In the first case the 
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frequencies of letters are counted beforehand, then a code is built, in the second case the 
code is changed along with the word reading. 

Output words of Shannon or Huffman’s codes are of different lengths. There is a 
compressing code, whose output words are of equal length. It was developed by first G. 

L. Khodak (1969) and then by F. Jelinek and F. Schneider (1972). 

A decipherable code for integers is constructed by V. Levenstein (1968) and P. Elias 
(1975). 

Hansel (1962) and Krichevski (1963) exploited the source coding to lower-bound the 
length of threshold formulas. 

The Kolmogorov complexity bridges the theory of algorithms and the information 
theory. Any code, including Lempel-Ziv’s, move to front, etc., is only a majorant of 
Kolmogorov’s. 

In other words, majorants of Kolmogorov codes provide a theoretical framework for 
compression algorithms. In this context, optimality is tied to computability, as well as, 
compressibility. Computing cost is typically a critical operational parameter, but until recently, it 
has not been an integral theoretical component. 

Krichevsky also notes that “for many years the target of the source coding theory was the 
estimation of the maximal degree of the data compression. This target is practically hit today. 
The sought degree is now known for most of the sources. We believe that the next target must be 
the estimation of the price of approaching that degree. So, we are concerned with the trade-off 
between complexity and quality of coding.” The reader may need to be reminded that in this 
theoretical context, the sources are assumed to be known and well behaved. 

Computing time is a critical factor for agent based communication links but until recently 
it has not been a theoretical focus of concern. Accurate and robust source models are another 
critical operational concern that is often overlooked by information theorists. Krichevsky’s 
analysis assumes that the sources are known and well behaved. Temporal variability is not 
within the context of his work. 

Within its own scope and context, coding theory provides a basis for determining 
compressability. With full knowledge of their utility and limits, compression algorithms can be 
identified and utilized at the most appropriate times. As the characteristics of data sources 
change with different tactical situations, software agents can activate the most appropriate 
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compression algorithms and source models for the particular context. Agent’s can operate on 
knowledge within and beyond the scope of coding theory and data compression. How these 
bodies of knowledge are coordinated is of particular concern for optimizing the operational 
throughput of a network. 

d. Universal Prediction, e.g. Machine Learning, Generalized Approximation, etc. 

Universal prediction is another recent focus in coding theory. If a source is well behaved 
and can be modeled, then the source should become known over time and the sequence of 
symbols should be optimally predictable. Theoretically, if the symbols can be predicted, optimal 
coding schemes should incorporate such predictability. Traditionally, predictability is not 
assumed. For adaptive schemes, lack of predictability is the alternative focus. 

Universal predictive coding schemes are rooted in the same mathematics as related areas, 
such as machine learning, generalized approximation, pattern recognition, classification, and 
adaptive filter theory. Probability and mathematical statistics are typically developed and applied 
using the numerical methods of linear algebra. When possible, these techniques often combine 
empirical and analytical modeling methodologies. Analytical models are preferred over 
empirical “black box” approaches. 

Just as the Krichevsky and others are interested in bridging complexity theory and coding 
theory, those working in the area of universal prediction and interested in bridging the 
mathematics of prediction and coding theory. The Agent approach under development is not 
limited to coding theory, or those areas of coding theory most related to efficient network 
communications. As previously noted, coding theory has its limitations and Agents address those 
limitations while exploiting opportunities which are not within the scope of traditional coding 
theory approaches. 

Coding, complexity, and prediction can each be exploited by a more abstract system 
process. At this level of abstraction, a spectrum of competing data compression algorithms can 
be evaluated. This can be done alongside competing prediction algorithms. For real time 
execution, algorithmic complexity is a critical factor for both coding and prediction. The Agent 
architecture is designed to manage these traditional components, but these plug-in capabilities 
could probably be managed by a number of other more traditional techniques. The ability to 



25 



identify and quickly utilize system wide knowledge, is the focus of agent based communication 
links, i.e. Agents. The agent based architecture provides the mechanism for incorporating 
additional bodies of knowledge for maximizing network throughput. 

Figure 1 1 illustrates the relationship between pattern recognition and data compression. 
In both cases, invariances are of special interest. The goal is to identify functions whereby 
predictable and recognizable patterns can be identified and exploited. A number of “plug-in” 
pattern recognition capabilities have recently become available. Agents are being designed such 
that those capabilities can be managed and exploited. The design objective is to demonstrate 
“coordination” and “organizational” network processes that manage C2, system design, data 
compression, pattern recognition, and other bodies of knowledge for the purpose of maximizing 
the effective throughput of tactical communication networks currently operating in the Fleet. 
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Figure 1 1 

As mentioned earlier, universal predictive coding is inherently related to machine 
learning. Intelligent real-time systems share a similar interest in machine learning. The objective 
for hierarchical intelligent systems is to design systems such that they can learn to create optimal 
tasks and coordinate their execution. The following is from Lima and Saridis [62] and discusses 
work related to this objective: 

A robot can learn from the data provided by its external sensors (such as cameras, ultra- 
sound transducers, or proximity detectors) or internal alarms (such as a battery failure or 
a time-out while a process runs). Often the goal is to leam how to recognize a scene or an 
object in the scene, by either a statistical or a structural approach. The robot explores the 
similarities of the scene or object with previously stored patterns. Some learning 
programs record the strategy used to solve a particular problem so that they don’t have to 
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search for a solution when the problem emerges again [Samuel 1963]. Other programs, 
such as those based on neural networks, begin with a random network and continually 
change the connections (or weights) among network nodes, to reflect a bad or good 
performance as they try to accomplish their job [Hertz 1991]. This later type of learning 
algorithm is usually classified as unsupervised if it receives no information about the 
correctness of its output, and as supervised if the goal is available. We can improve the 
algorithm’s performance by measuring the error between its goal and its actual output. A 
third category is reinforcement learning, where limited information is available about the 
algorithm’s instantaneous performance, typically in the form of success or failure signals. 

Since the late sixties, various strategies based on reinforcement learning have emerged to 
address the control of complex systems. Reinforcement learning is particularly 
interesting for robotics, where it involves the exchange of small bandwidth information 
(failure or success) between robotic subsystems. In typical applications, such as 
unmanned space or underwater missions, the cost of large bandwidth for 
communications between the central command (earth controller or main-vessel 
controller) and the vehicle is prohibitive. Designers can reduce this cost by increasing the 
autonomy of the machine involved in the mission. 

Fu was perhaps the first to write about learning control systems and to define intelligent 
control systems as systems of an interdisciplinary nature where artificial intelligence and 
automatic control intersect [Fu 1986]. Moreover, he introduced the concepts of stochastic 
automata and stochastic grammars. Narendra and his associates have also developed 
work on stochastic automata [Narendra and Thathachar 1989]. In the last few years, 

Sutton and his associates have explored reinforcement learning solutions that associate 
these two views of stochastic automata [Sutton 1988]. 

A frequent limitation of reinforcement learning applications to robot problem’s is the 
problem’s large state space. Lin attempted to tackle this problem by providing initial 
knowledge to the robot and by endowing it with generalization capabilities via neural 
nets [Lin 1994]. Sutton described an algorithm that learns from both virtual experiences 
in an internal-world model and real-world experiences, to accelerate the learning process 
[Sutton 1990]. Both of these authors used Watkins’s Q-learning algorithm, which 
includes a learning-performance function [Watkins and Dayan 1992]. 

Finally, Wang and Saridis used reinforcement learning to improve performance at the 
intermediate level of the hierarchy [Wang and Saridis 1993]. And Mclnroy and Saridis 
proposed reliability as a practical measure of primitive-task performance - but did not 
consider the computational cost [Mclnroy and Saridis 1994]. 

The above discussion illustrates that neural nets are but one technique of machine 
learning. In practice, neural nets are more accurately described as a family of regression 
techniques utilized to generate empirical models. When there is limited communication 
bandwidth between agent processes, e.g. mother ship and vehicle, agents can minimize 
communication requirements. For purposes of the Agent architecture, agents with synchronized 
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models only need to communicate when one of the agents recognize a deviation from the “real 
world,” i.e. part of its environment not shared with the other agent. Furthermore, since the goal 
is to minimize communication requirements between such agents, synchronized learning 
algorithms are another body of knowledge to be exploited. 
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ENGINEERING DESIGN FOR A NODE AGENT 
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Figure 13 

A. CODESIGN: SOFTWARE ENGINEERING FOR REAL-TIME SYSTEMS 

Delays in communication are a primary concern for tactical networks; just as 
transcontinental phone conversations were plagued with annoying delays due to the limitations 
of hardwired signal transmission. The addition of any process in the path of C2 messages is an 
even greater concern. Because Agents map messages and communications to more efficient 
representations and transmission timing, they inherently introduce delays on a per transmission 



29 



basis. If Agent processes are introduces at the 1553B bus, very small latencies may be required. 
This may be a significant constraint. If the Agent processes are designed into the CDLMS, much 
flexibility will be gained, but message management will still delay message communications. 
For any communications network, messages are typically delayed due to typical management 
functions. The critical concern is that such delays be minimal. For time critical data compression 
applications, data compression processors (DCP) are typically incorporated into the system 
hardware. This is a “plug-in capability” that our agent-based approach can manage in the course 
of minimizing message traffic while maximizing effective throughput. Other algorithm specific 
application processors (ASEP’s) are commercially available. Agent’s provide an opportunity to 
utilize those off-the-shelf plug-in capabilities, as well as. 

Several domains such as embedded, real-time, and reactive systems are the application 
areas for which hardware / software codesign techniques are most beneficial. They are beneficial 
because the increasing complexity of advanced systems combined with technological 
advancement requires new design methods and integrated tool environments. 

Hardware / Software codesign is different from conventional approaches in that it 
continuously relates the hardware development cycle to the software development cycle. Hence, 
hardware decisions significantly affect software design activities and vice versa. In codesign, the 
entire problem is treated as a whole. The “co” means primarily together. However, it also 
expresses a design flow that is properly coordinated; a joint effort among designers from 
different areas. The effort is inherently concurrent in that the majority of all design steps are 
carried out in parallel by a team that guides, coaches, and supervises the design process. Ideally, 
several teams of experts develop system components rapidly and resolve problems in a timely 
manner. 
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C. SOFTWARE REQUIREMENTS - A FUNCTIONAL DISECTION LEADING TO A 
PROPOSED DESIGN FOR IMPLEMENTATION 



• Agent will not be specific to any particular digital network, but will be 

appropriate for any digital network where it can intercept and alter messages to 
and from all nodes on the network. 
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Agent will be a software agent in structure. 

Agent will be able to run under any common operating system. 

Agent, in the engineering design, does not require any particular computer 
language or programming style. 

Design & Architecture must support modular concepts of engineering and be 
adaptable to future enhancement / expansion. 

The design of Agent should be as general as possible within the constraint of 
performing within the tactical network environment and Link- 16. 

The software and design should be readily adaptable to major link component 
upgrades, as well as future systems and technologies. 

Agent control should be distributed rather than centralized. 

The Agent must be instantiated on every node in the link. 

The Agent must not require a special or central node for control or any other 
purpose. 

The Agent can continue to operate correctly with the loss of a number of 
transmitters or receivers. 



Functions & Operation 



Agent will be able to perform all known heuristics (e.g., delta messaging, update 
bundling, extrapolated updates, and so forth), and be extensible to other 
analogous techniques as they are developed. 

Agent must decouple its processing components from its knowledge of link rules 
and operations, so that a simple substitution of link rules and operations can 
enable it to work with other tactical links (e.g.. Link- 1 1, Link-4 A, Link-22). 

Instantiations of Agent will be able to communicate with each other in order to 
optimize their operations. 

Communications between instantiations of Agent should not be visible to host 
processing systems and the C2P (JTIDS, of course, will be aware of these 
transmissions). 

Inter-agent communication should not in itself constitute a significant burden to 
link operations (that is, should consume relatively little bandwidth or link 
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overhead). 



D. FUNCTIONAL REQUIREMENTS 



These requirements are specific to Link- 16, but many may apply to other digital 
networks. 

1. Agent will increase the effective bandwidth of one or more Link-16 NPGs. 

a) Agent will be able to measure the message loading within an NPG. 

b) Agent will be able to detect when the message loading reaches, or is about to 
reach, saturation. 

c) Agent will be able to selectively employ an heuristic or combination of 
heuristics to reducing the loading within an NPG (and, by so doing, increasing 
the effective bandwidth). 

d) Agent will rely on Time Slot Reallocation (TSR) to distribute the additional 
effective bandwidth to other purposes, including RTR messages. 

2. No “rules of the link” will be violated by Agent operations. 

3. Agent operations will degrade operations within an NPG to the minimum extent 
necessary to prevent or delay NPG saturation. 

a) Agent will employ lossless techniques whenever possible. 

b) Agent will employ lossy techniques only when necessary to avoid saturation. 

c) Agent will explicitly manage NPG degradation due to its operations. 

d) Agent will employ lossy techniques only when such losses are operationally 
acceptable. 

4. Agent will accommodate all current and planned users of J-Series information. 

5. Agent will not delay the transmission of J-Series messages beyond the point of 
military usefulness to the receivers within the tactical context. 

6. Agent will be able to function transparently on the bus that leads to or comes from 
the JTIDS terminal (when implemented in a JTIDS environment). 

7. Agent shall work when implemented on Link- 16 but not be specific to Link- 16. It 
should readily be reconfigurable to operate within other tactical links. 
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8. Agent will not fail or degrade if one or more nodes unexpectedly drop off the link. 
There will be no main or central node. 

9. Agent will be able to operate successfully in an environment with degraded 
connectivity (that is, one or more receivers fail to receive a complete message stream, 
probably due to loss of line of sight). 

10. Agent will be able to measure its own effectiveness and dynamically alter its 
operations in response to those measurements. 

11. Under no circumstances will Agent operations cause an erroneous J-Series message 
to be received at the host component. 

12. Agent will provide a complete tactical picture within militarily-acceptable intervals, 
which may be context sensitive. 

a) To platforms recently entering the link. 

b) To passive listeners. 
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E. KNOWLEDGE-BASED COMPONENTS 



1. General 

The knowledge-based components for Agent are especially designed for Agent 
operations. Many changes from the usual knowledge-based paradigms were necessitated by the 
need to do more than simply recognize situations, but also to perform actions in response to 
those situations. Therefore, this section is designed to describe the special properties of the 
knowledge-based components so that, during detailed design, appropriate tools can be properly 
designed (e.g., translators from KIF to Agent form, translators from Agent form to semantic 
networks, and the inference engine). 

2. Semantic Networks 

Facts are truth assertions used either to create or edit a semantic net, or provide the input 
to inferencing. They are also used by the metarule functions. It is assumed that the reader has a 
general knowledge of semantic networks, facts, production rules, inference engines, and other 
related terms used in artificial intelligence. 

a. The Semantic Network 

Underlying the facts and rules of Agent is the concept of the semantic network. This 
network is created within global memory available to any of the processes (and processors) of a 
single instance of Agent. The semantic network is created at agent initialization by the use of 
facts with an expanded grammar. For example, 

A message is_a null_class 

The value facet of the max_delay slot of the message is 25 ms 

The first statement creates a object called “message” and assigns it with an “is_a” 
relationship to the null class or object (which is created automatically in shared memory). The 
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second statement creates a slot called “maxdelay” within the message object. A component of 
the initialization process creates this semantic network in persistent store (using IBM’s ABE 
terminology). 

In the present design, this network cannot be added to during agent operations. Although 
“is_a” statements during initialization create classes, “is_a” statements in rule consequents create 
instantiations (or objects). 

The inference engine “looks up” values in the semantic net when antecedents reference 
slots in the semantic network, and “sets” values in the semantic net when consequents fire. 




Figure 19, Agent Semantic Network 



b. Slots and Objects 



( 1 ) NULL slots and the NULL obj ect. 

A special object is always present at the top of the semantic network, the null frame (or null 
class or null object). All classes and instantiations are subclasses of the null frame. It has many 
uses. 



It is awkward to always have to explicitly refer to a slot reference in the rules: 
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The <slot name> of the <object name> is <value> 

To simplify where possible, the <object name> always defaults to the null object. For 

example, 

The rules below use this construction often: 

IF Agent is operating 

This usage refers to the “Agent” slot of the null object, whose value might be 
“operating.” On the other hand, “The mode of Agent is operating” refers to the mode slot of the 
Agent object. 



(2) Slot References 
A slot reference is of the following formats: 

<ffame> 

<slot> [<ffame>] 

<slot> [of ANY <ffame>] 

<slot> [in ANY <ffame>] 

[THAT] <ffame> 

[THOSE] <ffame> 

If the name of the slot is omitted, then the default slot of the named frame is referenced. 
If the name of the frame is omitted, then the null object is assumed. So the following are slot 
references: 



1. The door 

2. The door of the house 

3. The door of ANY house 

4. THAT | THOSE door 

In the first case, since the name is omitted, the slot is part of the null object. In the second 
reference, the slot is part of the house frame. If both are used they reference different values. In 
the third form, the slot reference is to the set of all slots of all instantiated houses. In the last 
form, which is always paired with one or more ANY constructions, the reference is to all slots 
referred to by the matching ANY constructions. 

(3) Dynamic object and slot creation 

Note that these constructions do not create slots or objects in antecedents, but do create 
slots and objects in consequents. If there has been no statement such as 

THEN Agent is operating 
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there is no “Agent” slot in the null object when 

IF Agent is operating 

is encountered by the inference engine. Therefore the test fails and the slot is not created. 
(However, the slot could have been predefined and initialized during Agent initialization — the 
point is that it doesn’t have to be.) This kind of dynamic object and slot creation makes it much 
easier to write rules where perishable knowledge is involved, and, in fact, is the preferred way of 
writing such a rule. 

F. FACTS 



1. General Forms 



Facts in Agent are references to a semantic network. Facts consists of the following 

forms: 



<slot > [of <ffame>] declarative operation> <value> 

<slot > [of <ffame>] <relationship> <ffame reference> 

[<facet >]of <slot > [<ffame>] declarative operation> <value> 

The <ffame> refers to the class or object itself, the <slot > is the name of a slot within 

that frame. The <facet> refers to a facet within a slot. Items within square brackets are 

optional — when omitted, the reference defaults to the null object. Facet references are only used 

in building the semantic network, which takes place at Agent initialization time. 

Sample declarative operation: 

is 

Sample relationships: 

is_a 

has_a 



Below are a set of sample facts, with their forms in parentheses: 

The default_facet of the weight slot of the airplane frame is 32,000 lbs (3) 
The house is green (1) 

A house is_a building (2) 

A building has_a door (2) 
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Note that there are a number of words used for readability that are completely ignored by 
Agent, such as “the” and “a.” 

In a later section of this report, the use of facts in rules is explained and differences in 
syntax and semantics are noted. This section does not explain how the various forms of facts are 
used. It is often helpful to have plural forms of slot or frame names. When defining a semantic 
net, for example, the following form can be used: 

The house(s) is_a building(s) 

This creates a house frame with a synonym for house being houses. There is no meaning 
associated with plural forms — they are used precisely as singular forms. 

2. FactSets 

FactSets are collections of facts referenced by name. See under MetaLanguage. 

G. PRODUCTION RULES 



1. General Forms — Antecedents and Consequents 

The general forms of Agent production rules are as follows: 

1. <slot reference> <operation> (<value> [<units>] ) [ONLY | ALSO] 

2. <slot reference> <comparison> <value [<units>] ) [ONLY | ALSO] 

3. <slot reference> <comparison> <slot reference> [ONLY | ALSO] 

4. <directive> <slot reference> | <frame reference> 

2. Comparisons 



The following comparisons are defined in Agent: 

is - a positive statement of truth 
isn’t - a negative statement of truth 
is_greater_than - > 
is_greater_than_or_equal_to - >= 
is less than - < 
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is_less_than_or_equal_to - <= 
is_equal_to - numeric equality 
is_not - logical negatation 
is_identical_to - non-numeric equivalence 

is_the_same_as - non-numeric equivalence, less forceful than equality 
is_different_from - non-numeric non-equivalence, less forceful than is_not 

These all have meanings which are applied on a concete, and abstract formulation of 
comparisons. 



3. Antecedent format 



An antecedent appears in the following format: 

1. <slot reference> <comparison> <value> | <slot reference> 

2. There is <object > 

The following are examples of antecedents of the first form: 

IF The color of the house is green 

The color of the house is_the_same_as the color of the bam 
The mode is delta_messaging 
THAT message is_a J3.2 

These antecedents, when referenced, find the appropriate slot reference, perform the 
comparison, and return a truth value. It is important to note that they change nothing. In the last 
antecedent shown above, the “is_a” checks to see if any message signified by the THAT 
construction is instantiated as a J3.2 message (or instantiated from one of its subclasses). An 
object can have an “is_a” relationship to any number of classes. 

Each antecedent is implicitly connected by logical conjunction. However, an OR 
construction can be used when parentheses are added: 

IF (The color of the house is green OR 

The color of the house is red) 

The color of the house is the same as the color of the bam 
The mode is delta_messaging 

This form, when compiled, results in two separate rules with no OR construction. 

Antecedents of the second form are special. The parser, upon seeing “There is” beginning 
an antecedent, checks to see if there have been any instantiations of the object references, and 
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returns TRUE if there have been, FALSE otherwise. In addition, there is an implicit ANY 
involved, in that subsequent uses of THAT will refer to the instantiated objects. 



4. Consequents 



<slot reference> <operation> <slot reference> | (<value> [<units>]) 

The operations in consequents change the values of slots or create new relationships. The 

defined operations are as follows: 

is 

is_a 

has_a 

Consequents are executed in the order given. 



Sample Complete Rule 


IF 


(The color of the house is green OR 
The color of the house is red) 

The color of the house is_the_same_as the color of the bam 


THEN 


The house is desirable 

The price of the house is unknown ONLY 



This rule is provided to illustrate many of the bad things one can write in rules. For 

example, no specific house is referred to. Therefore they will fail since the do not refer to 

instantiations. The rule should have been written as follows: 

IF There is a house 

There is a bam 

(The color of THAT house is green OR 
The color of THAT house is red) 

The color of THAT house is_the_same_as the color of the bam 
THEN The house is desirable 

The price of the house is unknown ONLY 

In this version of the rule, the consequents are vague. Since, in the first consequent, we 
refer not to “TFIAT house” but, simply, “house,” the value “desirable” is concatenated (see 
below) with other values in the default slot of the house frame, if there is one. Since this is a 
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modification to declarative knowledge, all houses would default to “desirable” as a result. 
Similarly in the second consequent. Here’s another try: 

IF There is a house 

There is a bam 

(The color of THAT house is green OR 
The color of THAT house is red) 

The color of THAT house is_the_same_as the color of bam 

THEN The appeal of THAT house is desirable 

The price of THAT house is interesting ONLY 

Note that the first two antecedents identify the set of instantiated houses and instantiated 
bams. But the “color of the bam” reference will probably cause a problem since what we are 
probably interested in is the set of houses that have bams of the same color. So, finally, here is 
the form that seems correct: 

IF There is a house 

THAT house has_a bam 

(The color of THAT house is green OR 

The color of TFLAT house is red) 

The color of the bam of THAT house is_the_same_as the color of 
THAT house 2 

THEN The appeal of THAT house is desirable 

The price of THAT house is interesting ONLY 

Note that “has_a” relationships amount to a special type of slot reference (special in that 
all of the slots and other linkages of the bam frame are now part of the house frame). 



H. CONCATENATION IN THE RULES 

Multiple values can be assigned to the same slot simultaneously. 

THEN The house is green 

does not refer to any house in particular (it isn’t “THAT house is green”) and there is no 
object reference. Therefore a slot is created in the null object called “house” and its value is 
assigned to be “green”. (Slots are automatically created when referenced — they do not all have 
to be predefined in the semantic net creation step of agent initialization. They are never 
removed.) 

! Two uses of THAT in the same antecedent cause a pair-wise comparison. 
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THEN 



The house is green 
The house is big 



The above concatenates two values in the house slot of the null object, “green” and “big.” 
Therefore, 



EF The house is green 

succeeds, because “green” is one of the slot values. 

IF X 

THEN The track of that message is xxxxx 

The track of that message is yyyyy 



In this construction, the message referred to as “that” message now carries both track 
IDs. This is because the slots of the object 3 concatenate values assigned to them rather than 
replacing them by default. A query as to the track ID will match as long as one of the 
concatenated values match. 

If it is desired that only one value be present in a slot, there is a directive for that in the 
production grammar: ONLY. It looks like this: 

IF X 

THEN The track of that message is xxxxx 

The track of that message is yyyyy ONLY 



Of course, the assignment of xxxxx wouldn’t actually be in the real rule if ONLY was 
meant. When a consequent ends in ONLY then all concatenations are removed and the single 
value is assigned to the slot. 

The temporary message is where all operating processes/processors assemble a batched 
message. There needs to be only one of these, as each process/processor will contribute to a 
single output message. They only need to lock that message while they add to it. Also, if the 
message is long enough, or the oldest update has aged enough, the message becomes an output 
message and is deinstantiated as a temporary message. Any process with the authority of adding 
to the temporary message can make this switch. Also, this is how the exception handler outputs a 
message when it senses and exception condition. 



3 An “object” can either be a class (frame) or an instantiation of a class (frame) in this document. 
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ANY/THAT/THOSE CONSTRUCTIONS 



1. General 

When ANY appears in an antecedent, the inference engine creates a list of all objects that 
satisfy the specified conditions (if any). Thus, 

IF The status of ANY message is late 

will create a list of the internal identifiers of all late messages. The THAT (or THOSE) 

construction then refers to all items in that list. Therefore, 

IF The status of ANY message is late 

THEN The status of THAT message is output 

changes the status of every instantiation of message with “late” status to “output” status. 
If there are no objects (instantiations), then ANY fails. 

2. Subsets 



IF The status of ANY message is late 

The type of ANY of THOSE message is video 
The time of ANY of THOSE message is today 
RULE THOSE messages are no good 

In this instance, the THOSE in the consequent refers only to “late video messages 
generated today.” In this construction, the antecedents have order (each subsequent ANY creates 
a subset of the previous ANY). However, in the consequent, only the smallest subset is referred 
to. This is the only instance in the grammar where the antecedents must be ordered. Therefore it 
may take several rules to inference properly. 
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3. 



Evaluation Order 



A slot has one or more facets. Every slot has the value facet, which, unless initialized 
when the semantic net is initialized, is initially empty. However, the convention in Agent is that 
there are two other facets in every slot: the default slot and the if_needed slot. 

The default facet contains a value that is used if the value facet is empty. In general, rule 
bases to do alter the default facet. There is a value in the value facet and a rule causes that value 
to become empty, the inference engine will use the default value at its next opportunity, if there 
is a default value. 

Finally, an “if needed” function belongs to the third standard facet. If the first two facets 
are empty, and there is a function in the if needed facet, that function will return a value. So 
when referencing a slot, the following order is used: 

1 . value 

2. default 

3. if needed function 

If the rules writer knows that the if needed function need only be called once, the 
following construct can be used: 

The temporary_value is the cost of THAT message 

where the “cost” is calculated with an if needed function. Then “temporary_value” can 
be used until the cost needs to be recalculated. Using if needed functions is best when the value 
returned must be measured from a constantly changing world model. 

IF The distance from landing of the aircraft is greater than 20 miles 

In this case, each time the distance_from_landing slot is accessed, the if needed function 
provides the current answer. 
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J. 



DIRECTIVES 



Directives are special words which perform special actions in consequents of rules. They 
have a different syntax that ordinary consequents. Examples are DEINSTANTIATE, LOCK, 
UNLOCK, MOVE, INFER, etc. 

IF ANY message is late 

THEN DEINSTANTIATE THAT message FROM <frame> 

1. DEINSTANTIATE 

DEINSTANTIATE <object> [FROM <frame>] performs the opposite action of “is_a” in 
a rule consequent. Instead of creating an instance or object, it deletes it. I know of no standard 
inference engine which allows deinstantiation, but we have to have it in order to get rid of 
messages that no longer have any value to Agent processing. 

The part of the consequent that follows the DEINSTANTIATE is parsed according to the 
normal grammar. So that, in the above case, “THAT message” refers to the subset created by the 
antecedent use of ANY. 

If no FROM clause appears, the object is removed from all of its is_a links. If a FROM 
clause appears, the object removed only from the listed frame. Thus an object, such as a 
message, can be instantiated in a temporary manner to a class, instantiated to one or more 
additional classes, and then removed from the instantiation list of the original “is_a” assignment. 

2. LOCK/UNLOCK 

LOCK <shared memory> | <class> | <object> | <variable> 
prevents all access to a sub-tree within shared memory, or to a named variable. Although 

LOCK can apply to any sub-tree, it generally applies to the whole of shared memory (which is a 

sub-tree of the null object) or to the instantiations of one or more messages. When other 

processes use a construction that accesses or changes a locked sub-tree, they can either hang 
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until unlock or fail. Hanging until unlock is easier to handle and understand than failure, so the 
design will cause the other processes to pause until unlock. 

One tricky part of LOCK/UNLOCK not shown in the rules below is that after the lock 
succeeds the rule has to be repeated because the lock of some other process could have 
succeeded before any given lock takes place. Thus the conditions that prompted the lock could 
no longer apply. Thus, if I locked an aged entry message, I could very well find out that some 
other process had marked it for encoding (so it was no longer an entry message). In practice that 
means that those messages identified by the ANY in an antecedent are no longer part of the 
THAT/THOSE of the consequents after locking. Which is why, after locking, the rule has to be 
repeated. These repeats are not shown in the rule sets developed for the engineering design for 
the sake of simplicity, but will be part of the detailed design. 

3. NUMBEROF 

NUMBEROF <frame> returns the number of instantiations (not classes) associated with 
that frame or any of its decendents. Thus 
NUMBEROF messages 

returns all of the messages that have been instantiated in the semantic network below the 
“message” frame. 

NUMBEROF THOSE messages 

will return the number of message as subsetted by previous ANY clauses. 

4. NAMEOF 

NAMEOF is used to return the name(s) of instantiated object(s). There are times when it 
is important to refer to specific instantiations whose names may have been assigned and not 
stored. 

NAMEOF THAT message 

returns the true message name. NAMEOF is generally useful in conjunction with 
variables, as in 

name is NAMEOF THAT message 
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The status of Sname is empty 



If NAMEOF matches multiple names, then all are returned. 

5. INFER 

A rule base can initiate inferencing on another rule base, pausing until the inference 
engine concludes. This not only makes the rules easier to read, but also eliminates inferencing 
when it isn’t needed. When ENTER is completed, the facts instantiated by the call become part of 
the facts available to subsequent rules in the calling knowledge base. It looks like this: 

IF X 

THEN fNFER knowledge-base 

It is assumed that the facts needed for the named knowledge base are the ones available 
to the calling data base (although the named knowledge base can trigger a function that generates 
additional facts, if it needs to). The resulting facts simply become available to subsequent rules 
and are returned to the calling base (or to the metarule level) as the results of the inferencing 
process. 

6. PARALLEL 

The PARALLEL directive allows certain rules to fire concurrently with other rules if the 
system has multiple processors available to it. The use of PARALLEL is discussed in the section 
below concerning automatic rule sorting. 

K. SPECIAL TERMS 

There are other terms, such as NULL, MINUS, TIME/ and FIELD that are like 
directives except that they are part of the normal rule syntax. 
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1 . 



NULL 



NULL is used in facts and rules to erase any values assigned to a slot. 

The color of the house is NULL ONLY 

resets the value to the empty set. This usage is completely different and distinct from the 
null class to which all objects and instantiations are related. In general, if NULL follows “is_a,” 
it refers to the null frame. In all other forms it refers to an empty value. 

Implementation is automatic in that if no value is assigned to a slot called “null” in the 
null frame, the result will be to return a null value. Expressions that would set a value into a slot 
called “null” are disallowed. 



2. MINUS/PLUS/D IVTDEDBY (Slot Arithmetic) 



For example, in the construction, 

IF The status of ANY message is entry 4 

The TIME/ minus the time of THAT message is_greater_than the 
message jjrocessingtimelimit 

a slot reference appears on either side and a numeric value of the slot is expected. If the 

values in the slots are not numeric the antecedent simply fails. If they are numeric, then 

subtraction takes place and a value results . This value can be stored explicitly, as follows: 

The TIME/ minus the time of THAT message is the 
delay_time 

This expression puts the difference into the delay_time slot. 

The idea works for other embedded arithmetic operations on slots. There is however a 
different form of arithmetic in the rules, which is described below. 



3. FIELD 



FIELD (source or destination string, starting position, number of characters) 



4 These rules cover the problem of an entry message (left or right) that has sat too long without being processed 
(encoded or decoded). 
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FIELD represents a substring delimited by a starting position and a number of characters. 
The source can be an input buffer or a slot value. The starting position starts at 1 . If the number 
of characters field is set to zero, then the all available characters from the starting position to the 
end are matched and the number of such characters is returned in the third parameter. If the 
number of characters is set to a positive value, that number of characters is matched. Thus 
NCHARS = 0 

string of message is FIELD (left_input_buffer,l,nchars) 

transfers the entire contents of the input buffer to the string slot of the message frame, 

and returns the number of such characters to the nchars parameter. 

In Agent initialization, the following statements might appear: 

A messageformat is_a NULL_CLASS 
A J3.2 is_a message format 
The track-position of a J3.2 = is 14 
The track-length of a J3.2 is 22 characters 

This declarative knowledge, stored in KIF format, is persistent knowledge. That is, it is 
not normally changed by consequents of rules as they are fired by the inference engine. They are 
usually part of the invariant part of the semantic net in shared memory. (The inference engine 
has the capability of dynamically changing persistent knowledge, which means that it is capable 
of learning, although Agent does not employ this feature.) 

In this way, with the use of field, any message can be decoded from its character string 
format (raw format) to a set of slot values within an instantiated object. The knowledge of how 
to do this is external to the agent, so that the agent can operate on any set of messages without 
changing its internals. The only assumption is that the message is in string format and the field 
positions are fixed. However, a version of Agent can be built that can deal with variable format 
messages as well as long as sufficient information is present in the message to be able to 
determine field starting positions (and, possibly, lengths). 

4. NCHARS 

NCHARS, which takes no arguments, returns the number of characters in the left or right 
input buffer. 
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